Mon. Not. R. Astron. Soc. 000, 000-000 (0000) Printed 1 February 2008 (MN MfeK style file vl.4) 



On the streaming motions of haloes and galaxies 



Ravi K. Sheth 1 , Antonaldo Diaferio 2 , Lam Hui 3 ' 4 & Roman Scoccimarro 4 

1 NASA/Fermilab Astrophysics Group, MS 209, Batavia, IL 60510-0500 

2 Dipartimento di Fisica Generale "Amedeo Avogadro" , Universitd di Torino, Italy 

3 Department of Physics, Columbia University, 538 West 120th Street, New York, NY 10027 

4 Institute for Advanced Study, School of Natural Sciences, Einstein Drive, Princeton, NJ 08540 
Email: sheth@fnal.gov, diaferio@ph.unito.it, scoccima@ias.edu, lhui@ias.edu 



o 
o 
o 

(N 
-t— > 
o 

O 

00 
> 

m 



O 
O 

S3 

Oh 
6 



X 



2000 October 5 



ABSTRACT 

A simple model of how objects of different masses stream towards each other as they 
cluster gravitationally is described. The model shows how the mean streaming velocity 
of dark matter particles is related to the motions of the parent dark matter haloes. It 
also provides a reasonably accurate description of how the pairwise velocity dispersion 
of dark matter particles differs from that of the parent haloes. The analysis is then 
extended to describe the streaming motions of galaxies. This shows explicitly that 
the streaming motions measured in a given galaxy sample depend on how the sample 
was selected, and shows how to account for this dependence on sample selection. In 
addition, we show that the pairwise dispersion should also depend on sample type. 
Our model predicts that, on small scales, redshift space distortions should affect red 
galaxies more strongly than blue. 

Key words: galaxies: clustering - cosmology: theory - dark matter. 



1 INTRODUCTION 

Gravity makes objects cluster. Therefore, the motions of ob- 
jects towards each other may provide information about the 
background cosmology. Of course, different subsets of the 
clustering particles may trace the underlying streaming mo- 
tions differently. The scale dependence of the mean stream- 
ing vf™(r) of dark matter particles has been understood for 
some time now (Hamilton et al. 1991; Nityananda & Pad- 
manabhan 1994). But there has been little study of how this 
statistic depends on trace-particle type. 

To do this, we build a model in which gravitational 
clustering is viewed as the combination of two processes. 
The first arises from the fact that gravity causes matter to 
stream towards local minima of the gravitational potential. 
This requires a model of how matter which was initially 
distributed rather smoothly around the centre of collapse 
becomes redistributed into a more centrally concentrated 
density profile as the collapse proceeds. The second process 
is that these centres around which local collapses are oc- 
curring, these clusters, are themselves moving towards each 
other: clusters cluster. It is the combination of these two 
types of motions which gives rise to the spatial distribution 
and streaming motions of objects today. 



Section 2.1 summarizes useful results which follow from 



linear theory. Section 2.2 shows how the streaming motions 
of collapsed dark matter haloes depend on halo mass. Sec- 
tion 2.3 uses this to model the streaming motions of parti- 



cles, rather than haloes. It shows what fraction of a particle's 
streaming motion arises from the motion of its parent halo, 
and what fraction must arise from motions within the halo. 
These smaller scale motions are essentially a consequence of 
the collapse around the halo centre we referred to earlier. 
It then presents measurements from numerical simulations 
which show that the model predictions are reasonably ac- 
curate. It also shows that the model provides a reasonable 
description of how the second moment of the pairwise ve- 
locity distribution of the dark matter differs from that of 
haloes. 

Section |^ shows how to extend the model to study the 
mean streaming motions and the pairwise velocity disper- 
sion of galaxies and presents measurements from semian- 
alytic galaxy formation simulations which show that the 
model predictions are reasonably accurate. Section ^ dis- 
cusses what this model implies if one wishes to use measure- 
ments of the streaming motions of galaxies to make infer- 
ences about cosmology. 



2 THE MODEL 

2.1 The mean streaming velocity 

We will begin by reviewing the strategy which led to the 
derivation of how vf™ (r) depends on scale. The relevant 
starting equation is the pair conservation equation in Pee- 
bles' book (Peebles 1980), but we will start with the equa- 
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tion in the form presented by Nityananda . 
(1994): 



d(l + Q = Mr) 
d In a Hr 



Padmanabhan 



(1) 



where £(r, a) is the volume averaged correlation function on 
proper (rather than comoving) scale r at the time when the 
expansion factor is a, and the Hubble constant is H . This 
says that if we know the correlation function for all scales 
r and all times a, then the assumption that the number of 
pairs is conserved allows us to compute how Vi2(r) depends 
on scale today. 

An approximate solution to this expression can be got 
as follows (Peebles 1980). Assume that £ evolves accord- 
ing to linear theory: £(r,a) = [D(a)/Do] 2 £(r, ao), where 
D(a) is the linear theory growth factor at a, and Do is 
the growth factor at the present time when a = ao- In 
an Einstein de-Sitter cosmology, D(a)/Do = a/ao. Then 
the left hand side is d £(r,a)/d In a = 2 /(fi) £(r, a), where 
/(fi) = d \nD/d lna. So, in this approximation we get 



«ia(r) _ 2/(0) f(r,a) 



Hr 



3 l+£(r,a) 



(2) 



On large scales, £ <C 1, and so this is just the usual linear 
theory expression with an extra factor of (1 + 5) in the de- 
nominator. While this approximation is fine on large scales 
(r > 10 Mpc/h), it underestimates the exact solution by a 
factor of 3/2 or so on smaller scales (Juszkiewicz, Springel 
& Durrer 1998; Sheth et al. 2000). 

Hamilton et al. (1991) showed they could compute a 
good estimate of the evolution of £(r, a), if the initial corre- 
lation function is known (also see Nityananda & Padman- 
abhan 1994). Hamilton et al. also showed that by inserting 
their expression for the evolution of £(r, a) into equation (|l|) 
above, they were able to describe the shape of Via 1 (7") wen 
on all scales. 

While this approach is very useful for studying the 
statistics of dark matter particles, it is not obvious that it 
can be used to estimate the streaming motions of galaxies. 
This is because one usually assumes that galaxies form at 
different times. This means that the number of galaxies is 
not conserved, so the number of galaxy pairs is not con- 
served. This means, for example, that the correlation func- 
tion of galaxies refers to different sets of particles at different 
times. Therefore, there is little reason to expect that insert- 
ing the correlation function of galaxies into the pair conser- 
vation equation should provide a good estimate of vf^(r) 
today. We show below that, provided one makes the correct 
choice of what one uses for £ g ai(r, a), the pair conservation 
equation can be used to provide an accurate estimate of the 
streaming motions of galaxies. 

2.2 The haloes 

This subsection is concerned with the first moment of the 
pairwise velocity distribution of haloes identified at the 
present time. Every halo will be represented by one particle, 
say, the one at the halo centre of mass today. Imagine trac- 
ing these centre-of-mass particles back in time. By definition 
the number of these particles is conserved, since all we're 
doing is following them back to high redshift. Of course, at 
high redshift, few if any of the haloes would actually have 



collapsed around these centre-of-mass particles. Neverthe- 
less, we will use the motions of these particles to represent 
the motions of the halo centre of mass. Peebles' pair con- 
servation equation, combined with the assumption that the 
motion of a halo today is the same as that of its associated 
centre-of-mass particle, says that if we knew £(r, a) for these 
tracer particles, then we can compute Wif lo ('") today. 

So, to compute «i2 l0 , we are stuck with the problem of 
studying the spatial distribution (i.e., the bias factor) of a 
special marked set of particles at earlier times. The case in 
which the marked particles (in this case, the halo centres- 
of-mass) are observed at a later epoch than when they were 
marked is familiar: e.g. this is like the Mo & White (1996) 
simple model for galaxies, in which galaxies formed in haloes 
at z = 3 but we only observe them today. Here, we are 
interested in the spatial distribution of the special particles 
at earlier epochs than when they were marked. 

The halo centre-of-mass particles are biased tracers of 
the dark matter distribution. The large scale bias factor is 
the square root of the ratio of the correlation function of 
these particles to that of the dark matter correlation func- 
tion on large scales. It depends on halo mass: 



Chh(r) » & 2 (m)£dm(7-), 



(3) 



and a similar equality holds for fhh(r). Here Cdm ( r ) denotes 
the initial correlation function of the dark matter extrapo- 
lated using linear theory to the present time. In equation Q 
we use the linearly extrapolated £dm( r ) rather than the 
present day £(r) as a practical way of taking into account 
the volume exclusion effects of haloes at small scales. See 
Section 2 in Sheth et al. (2000) for a detailed discussion. 
To a good approximation, 



b(m) = 1 + 



S c0 D{a)/D ' 



(4) 



where v(ra) = 5 c o/cr{rn) is a function which increases with 
decreasing halo mass, and D(a) and Do were defined ear- 
lier. At the present time, D(a) — Do and this is the familiar 
Eulerian bias formula from Mo & White (1996). The La- 
grangian bias factor is usually expressed as the ratio of £hh 
at the initial time to the linearly extrapolated (J^. This 
means that the Lagrangian bias factor is 



&La g (m) 



£ hh (r) D(a t ) _ D{a t ) v 2 {m) 



&) Do 



Do 



+ ■ 



5co 



where a,i denotes the expansion factor at the initial time. 
Since a,i <C do, &La g — > iy 2 ~ l)/5 c o, which is another familiar 
expression from Mo & White (1996). So, in this approxima- 
tion, 



d b(m) 
d In a 



/(«) 



1 - b(m) 



(5) 



halo 
2 



It is straightforward to insert these expressions for the 
halo correlation function and its evolution into the pair con- 
servation formula (equation |l|) to see how different v\ 
is from vf™. If we study the streaming motions of haloes 
of two different masses, then we must replace b 2 (m) — ■> 
6(mi) b(m,2). This gives 

vi2°(r) _ «2f(r) &i& 2 [l + £dL»] 



Hr 



Hr [l + b!b 2 &)] 
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Figure 1. Mean streaming motions in the GIF simulations. Open circles and crosses show the streaming motions of the haloes and 
the dark matter, respectively. Solid and dashed curves show our predictions for the haloes (equation which should be accurate on 
large scales, and the dark matter (the sum of equations ^| and p~o| ) , which should be accurate on all scales. Dotted lines show the Hubble 
velocity for comparison. 



m&r) [b 1 (l-b 2 )+b 2 (l-b l )} 
3 [l + bib 2 &)] 



(6) 



If we insert the linear evolution approximation for the rela- 
tion between the correlation function and V12 (equation^), 
then this becomes 



Vi2 lo (r) vt?(r) /61+62 



Hr 



Hr 



1 + W*£j£(r)" 



(7) 



Notice that when bi = b 2 = 1, then t^f lo (r) = t>i™(r). Also, 
in the large separation (small f) limit, v l i 2 lo (r) — > (bi+b 2 )/2 
times vf™{r)- So, on average and on large separations, rela- 
tive to the dark matter, massive haloes (bi + b 2 ) > 2 stream 
towards each other whereas less massive haloes (61 +62) < 2 
stream away from each other. This makes some physical 
sense; clusters cluster, so they are moving towards each 
other, whereas smaller clumps are in or at the edges of ex- 
panding voids, so they are separating from each other. This 
linear bias of the streaming velocities at large separation is 
consistent with the linear theory analysis of Fisher et al. 
(1994). 

Notice that vi 2 scales with the sum of the bias factors. 
If one ignored the evolution of the bias factor when using 
equation (^) , one would have concluded that the scaling was 
with the product of the bias factors — including the evolution 
of the bias factor is essential to getting the correct answer. 
Finally, notice that on smaller scales where £j™ > 1, this 
analysis suggests that v\ 2 of less massive haloes should be 
larger than that of the dark matter, with the opposite trend 
being true for massive haloes. Of course, the linear theory 
and linear evolution approximations we used to obtain equa- 
tion (0) are not accurate on small scales. Nevertheless, this 
provides at least some indication of the small scale behaviour 
of the halo streaming motions. 

Fig. [l] compares this model with measurements in the 
SCDM and ACDM GIF (Kauffmann et al. 1999) simulations 
which were run by the Virgo collaboration (Jenkins et al. 
1999) and are now available to the public. The open circles 
show the streaming motions of all the haloes with m > 2 x 
10 11 M e /h and m > 8.4 x 10 11 M e /h in the SCDM and the 
ACDM simulations, and the solid line shows what our model 



predicts. Specifically, it shows 



V? 2 l °(r) 
Hr 



= — / dmi / dm 2 



Hr 



n{mi)n(m 2 )[l + b(mi)b(m 2 )£^™(r)} 



2/(0) & ha io&) 



(8) 



3 i + fcLo^W 

where fihaio = J dmn(m) is the average number density of 
haloes, bhaio = J dmn(m)b(rri) is their average bias factor, 
the weighting factor in the second line is the ratio of the 
number of mi and 7712 halo pairs at r to the total num- 
ber of halo pairs at r, and the final expression follows from 
inserting equation (|?|) for i^if 10 and using equation (Q) for 
V12 . Our model, which we only expect to be accurate on 
large scales because our approximation for the halo correla- 
tion function, equation (w), breaks down on small scales, is 
reasonably accurate down to scales of order a Mpc/h or so. 
For comparison, the dotted curves show the Hubble velocity. 
The crosses show the streaming motions of the dark matter 
particles in the simulations, and the dashed curve shows the 
prediction associated with the model described in the next 
section. 

2.3 The dark matter 

The large scale net streaming motion of the dark matter can 
be got from our expression for the halo motions by integrat- 
ing up the contribution to the streaming motion from pairs 
in different mass haloes, weighting by the fraction of the to- 
tal number of pairs which are in such haloes, and weighting 
by the halo mass function: 



v 



2halo ^ 



12 



r) 



Hr 



dm 2 / dmi 



miti(mi) m 2 n(m 2 ) 
P P 

r^Lin ( „,halo / 



[l + Kmi)Km 2 )C(r)] t^°(r) 



1 + Cdm(r) 



Hr 



\r) 1 + &) 



Hr l+£dm(r)" 



(9) 
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The final equality follows from inserting equation (0) , noting 
that J drnmn(m) = p, and using the fact that the bias 
factors are defined so that J dmmn(m) b(m) = p. 

We can now make two important points. The first is 
that, at large separations, this expression equals Ui^M = 
«i2 alo ( r )i m this regime the streaming motions of the dark 
matter particles are entirely due to the fact that the haloes 
which contain the particles are moving. Moreover, in this 
regime, vf™(r) ~ Vi 2 (r), where v\ 2 is got from equation 
by using the linear theory values of £ and £. The second is 
that this expression exactly equals that in Sheth et al. (2000) 
for the contribution to vi^ir) from particles which are in 
separate haloes (see their eq. 19). This will be important in 
what follows. 

Notice that on smaller scales, £dm( r ) < ^dm(f). In 
this regime the halo motions only account for a fraction 
of V12 1 (r). The remaining contribution to vf^if) must arise 
from the streaming motions of pairs in which both parti- 
cles are in the same halo. This means that the fact that 
our model for halo motions is not accurate on small scales 
will not matter very much for the small scale value of vf™ 
because, on small scales, the fraction of pairs which are in 
separate haloes, and so are affected by this inaccuracy, is 
small. We turn, therefore, to a discussion of the streaming 
motions of pairs in which both particles are in the same halo. 

If haloes are stable, then the streaming motion within 
a halo exactly cancels the Hubble flow: —V\2(r)/Hr = 1. In 
this case, the contribution from pairs which are in the stable 
haloes equals unity times 



dm 



m 2 n(m) X(r\m) 



£lhaloM 



1+UW l+£dm(r)' 



where m 2 \(r\m) denotes the number of pairs at separation r 
which are in the same halo which has mass m; it depends on 
the density profiles of haloes. For all halo shapes of interest, 
this expression approaches unity at very small r, because 
Cdm(r) = Cihaio(r) + Cdm ( r ) ~ £ihaio(r), and £dm(r) > 1 on 
scales which are smaller than a typical halo. So, if stable 
clustering is correct, then —vf^ir^/Hr = 1 on small scales. 
In fact, the mean pairwise velocity on small scales depends 
on the low-mass behaviour of n(m) and X(r\m) — in general, 
there is no guarantee that n(m) and X(r\m) will conspire to 
give stable clustering (Ma & Fry 2000; Sheth et al. 2000). 

In particular, Section 4 of Sheth et al. (2000) shows that 
the small scale term is 



lhalo 
v 12 

Hr 



d f j m 2 n(m,a) 
91na / p 2 

d\nm,(a) [Cihaio(r, a) - 



dyy_ \(y\in.u) 



[l + £(r,a)] 



■ £ihaio(r,a) 



dlna 



3[l + £(r,a)] 



(10) 



where \{r\m,a) is proportional to the number of pairs in 
the same m-halo which have separation r, and n(m, a) is the 
number density of virialized m-haloes at time a. (Our nota- 
tion differs slightly from that in Sheth et al. — they absorbed 
the two factors of m into their definition of A, whereas here 
we have chosen to show these factors explicitly. Hence, our 
m 2 A is their A.) In Sheth et al., n(m,a) depended on time, 
and the halo profile did as well, because virialized haloes 
were assumed to have profiles of the form given by Navarro, 
Frenk & White (1997), and these halo shapes depend both 
on m/m*(a) and on the ratio of the average density within 
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Figure 2. Pairwise dispersions in the GIF simulations. Open cir- 
cles and crosses show the streaming motions of the haloes and 
the dark matter, respectively. Solid and dashed curves show pre- 
dictions for the haloes and the dark matter (from Sheth et al. 
2000). 



the virialized halo to the background density at the time it 
virialized. (Strictly speaking, the expression above assumes 
that all of the time dependence of the halo shape can be 
written as a function of m/m* — see Sheth et al. for details.) 

The sum of equations ([]) and (|lo| ) gives a complete 
description of the streaming motions of dark matter on all 
scales. The dashed curves in Fig. |l| show that this sum pro- 
vides a good description of the dark matter streaming mo- 
tions on all but the smallest scales (see Sheth et al. 2000 for 
a discussion of the discrepancy at the smallest scales) . Now, 
the dashed curves correspond to independently identifying 
haloes at each epoch. On the other hand, we could have 
followed the approach of Section 2.2 — identify or mark the 
haloes today, and then follow them backward in time. With 
this latter approach, the number density of haloes is fixed to 
the value today, n(m, do), but the profile changes from, say a 
tophat to a more centrally concentrated shape. Equation Q 
shows that the contribution to the streaming motions from 
particles in separate haloes is independent of the details of 
how this happens. However, recall that this contribution ex- 
actly equals the two-halo contribution to u"™ worked out 
by Sheth et al. (2000). This means that the one-halo con- 
tributions to V12 must also be the same in both approaches. 
In particular, this means that however the profile changes 
from a tophat to an NFW shape, it must change in just 
such a way that the final answer for the strea ming motions 
of the dark matter particles equals equation (|10[). Indeed, 
we can use this requirement to constrain how the profile 
changes from the initial tophat to the final NFW cusp — the 
Appendix shows a worked example of how to do this. 



2.4 The pairwise velocity dispersion 

So far, we have shown how the mean streaming motions 
of the dark matter and the haloes are related. Sheth et 
al. (2000) discuss how to do this for the second moment 
of the pairwise velocity distribution. They argued that the 
dark matter particles receive substantial nonlinear kicks to 
their initial velocities (essentially, the virial motions within 
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haloes) , whereas the haloes do not (Sheth & Diaferio 2000) . 
As a result the pairwise dispersion of the dark matter should 
be significantly larger than that of the haloes on all scales. 

Fig. ^ compares what their equation (31) predicts with 
the simulations (we refer the reader to their paper for de- 
tails of the model). The open circles and crosses show the 
pairwise dispersion, ai2(r), of the haloes and the dark mat- 
ter respectively, and the dashed and solid curves show the 
model predictions. (We do not show CT12 for the dark mat- 
ter in the SCDM simulations because the simulation box is 
sufficiently small [85Mpc//i] that cosmic variance affects the 
measurement significantly.) The model is reasonably accu- 
rate on large scales, and not accurate on small scales. Sheth 
et al. (2000) discuss why this happens for the dark matter 
(the inaccuracy is due to the simplifing assumptions that 
haloes have no substructure, the pairwise dispersion from 
a single halo is isotropic and independent of the pair posi- 
tion, and the number of pairs in the infall regime around 
haloes is sufficiently small that the use of virial motions to 
model the dispersion from infalling pairs does not not lead 
to a large error). For the haloes, this discrepancy appears 
on scales which are of the order of a typical m» halo and 
smaller. This suggests that the discrepancy almost surely 
arises from using linear theory to model the spatial distri- 
bution and velocities of haloes on scales which are smaller 
than the smoothing scale used to make the model predic- 
tion. Despite the quantitative discrepancies on small scales, 
the model is in qualitative agreement with the simulation: 
the pairwise dispersion of the haloes is substantially smaller 
than that of the dark matter. 



3 GALAXIES 

The previous section showed that the first moment of the 
pairwise velocity distribution of haloes is different from that 
of the dark matter. It showed that massive haloes separated 
by large distances are streaming together more rapidly than 
less massive haloes at the same separation, and that this 
difference scaled with one rather than two powers of the halo 
bias factor. It also showed that the dark matter statistic was 
obtained by weighting the halo statistic by the number of 
dark matter particle pairs per halo. The second moments of 
the pairwise velocity distributions are also different. In this 
case, also, the dark matter statistic is got by weighting by 
the number of particle pairs per halo. However, the pairwise 
dispersion is also sensitive to the fact that virial motions 
within haloes can be substantially higher than the motions 
of the haloes themselves. As a result, the pairwise velocity 
dispersion of dark matter particles is substantially larger 
than that of haloes, on all scales. This section studies what 
these results imply for the pairwise motions of galaxies. 

We will model galaxies as random particles in dark mat- 
ter haloes. That is, the motion of the galaxies is the same as 
that of the dark matter particle with which they are asso- 
ciated. In this sense there is no velocity bias in our model; 
the fact that velocity statistics for the dark matter and the 
galaxies may, nevertheless, be different, arises solely from 
the fact that dark matter statistics weight each halo propor- 
tional to halo mass, whereas galaxy statistics do not. Such 
models for the difference between the statistics of galaxies 
and dark matter particles have received considerable atten- 



tion recently. Seljak (2000), Peacock & Smith (2000) and 
Scoccimarro et al. (2000) have used them to model the spa- 
tial distribution of galaxies, Sheth & Diaferio (2000) describe 
how to model the distribution function of galaxy peculiar 
velocites, and Sheth et al. (2000) describe how to use these 
models to do analytically what Jing, Mo & Borner (1998) 
did numerically in their study of the pairwise velocity dis- 
persion of galaxies. 

Within the context of this model, galaxies are treated 
by setting 



(11) 



where the two terms denote the contribution to the statistic 
from galaxies in the same and in different haloes, respec- 
tively. The second term on the right hand side can be got 
by modifying equation (§1): 



»if al (0 

Hr 



where 



drri2 / dm 



g(mi)n(mi) g(m2)n(m 2 ) 

Pgai Pgai 

[1 + 6(mi)6(m 2 )ffl(f)] v^ lo (r) 



1+CgalW 



Hr 



(12) 



Pgai — 



dmj(m)n(m) 



and 



dm 



(m)n(m) \(r\m) 



+ J dm 2 J dmi 

x b{m 1 )b{rn 2 )^{r 

z-lhalo/ \ . 7 2 >-Liii/ \ 

= 4gai (r) + 6 ga i f dm (r). 



Pgai Pgai 

j(mi)n(mi) g(m2)n(iri2) 

Pgai Pgai 

Lin, 



(13) 



Here g(m) and <?2(m) denote the first and second moments 
of the distribution of the number of galaxies in m-haloes, 
and we set <?2(m) = if g(m) < 1. There are details as- 
sociated with how one treats the central galaxy in a halo, 
but, for the most part, these amount to a small effect (see 
Sheth & Diaferio 2000), so we have ignored them — they add 
complications but no essential change to the logic of our 
argument. 

On scales larger than a few Mpc/ft, Uif al dominates 
over the one-halo contribution. If we assume linear theory 
for the evolution of the two-halo term (equation @) , then we 
can set 2/(fi)£C/3 -> -vf^/Hr times 
equation dl3) reduces to 



dm Mj and then 



,,2gal ( 



Hr 



dm , 
«12 1 



r) 



Hr 



"gai 



Cdm ( 



1+^galW 



(14) 



This shows that, on large scales, the streaming motions of 
galaxies can be biased relative to the dark matter. The ex- 
tent to which they are biased is related to how differently 
they are clustered, and this, in turn, depends on the g(m) 
relation. 

On smaller scales, the streaming motions are dominated 
by galaxy pairs in which both members are in the same halo. 
A little thought shows that this can be computed simply by 
setting p — > pgai and m 2 — > g2 (m) in equation (|ic|). This is 
because the g(m) relation does not introduce any additional 
time dependence — recall that the number density of haloes 
in the present model is fixed to the value it has today, and 
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Figure 3. Mean number of bright galaxies as a function of par- 
ent halo mass in the ACDM GIF semianalytic galaxy formation 
model of Kauffmann et al. (1999). Top panel shows the result 
of dividing the sample into two based on colour. Bottom panel 
shows a division based on star formation rate. Crosses, circles, 
squares and triangles are for objects classified as being red, blue, 
quiescent and star-forming galaxies respectively. 



the galaxies are to be thought of simply as marked tracer 
particles within the haloes. 

This has an interesting consequence. Suppose one 
wishes to use the pair conservation equation (hi) to esti- 
mate vf^(r). Then the model above suggests that, on small 
scales, simply inserting the observed galaxy correlation func- 
tion into equation (|l|) should be reasonably accurate. How- 
ever, on larger scales, doing this leads to an estimate of vf 2 
which is incorrect, for the following reason. Because there 
is no time dependence in g(m), the result of doing this has 
the same time dependence as in the case of the dark matter 
for which (/(to) — m, or the case of the haloes for which 
g(m) — 1. But we know that, for the haloes, doing this re- 
sults in the wrong answer, because it neglects the evolution 
of the bias factor. The case of galaxies is no different; if we 
insert the observed galaxy correlation function into the pair 
conservation equation, then we are incorrectly neglecting the 
evolution of the bias factor of the galaxies. This would lead 
one to conclude, incorrectly, that V12 should scale as 6g al 
rather than as fe ga i. 



3.1 Comparison with simulations 

To illustrate how our model works, we will use the g(m) 
relations we obtained from the semianalytic GIF ACDM 
model of Kauffmann et al. (1999) which are now publi- 
cally available. Sheth & Diaferio (2000) provide a fitting 
formula for the g(m) relation of a GIF galaxy catalog 
which was constructed by choosing all galaxies brighter than 
A/v = —17.7 + 5 log ft, after accounting for the effects of 
dust. We divided that catalog up into two subsamples based 
on colour (galaxies labelled as being redder or bluer than 
B — I = 1.8) and on star-formation rate (rates greater or 
less than 2M Q /yr). Fig. | shows these relations. The dashed 
lines show the following fits: 
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Figure 4. Correlation functions of different tracers of the dark 
matter density field in the ACDM GIF semianalytic galaxy for- 
mation model. Filled circles are for the dark matter, crosses are 
for red galaxies, squares for galaxies which have low star forma- 
tion rates, triangles for galaxies with high star formation rates, 
and open circles for blue galaxies. The two solid curves show our 
model predictions for the red and blue galaxies, and the dashed 
curves show what happens if we use the second factorial moment 
of the galaxy counts, rather than the second moment when mak- 
ing our model predicition. For comparison, the dotted curve shows 
the predicted dark matter correlation function. 



+ (mn/30) 



A'bIuc(to) 

iVked (to) 
■WhSFR(m) 
Nl3 F B,(m) 



(l/»ll) 

4[log 10 (mn/6.2)] 2 



= (mn/500) us +0.6e 
= iVA.n(ra) - iVBiue(m) 

= 0.015 + (mn/7000) a9 + e -W°zw(™n/™-2)] 2 



= JV A n(m) - AWr(to) 



(15) 



where mil is the halo mass in units of 10 11 Mq /h. The solid 
lines are the same in both panels; they show ./Vaii(to), and 
they equal the sum of the two dashed lines. These relations 
can be used to compute the statistics of one of these galaxy 
samples, rather than dark matter particles, by setting g(m) 
equal to the appropriate N gs ,\(m) relation. 

In addition to these mean N ga \(m) relations, our models 
also require the second moment of the number of galaxies 
per halo distribution. We have approximated it by setting 
<?2(to) = if g(m) < 1, and 



gi (m) = n 2 (m) g 2 (m) + ng{m) 



ATaii(to) 



(toh/700) 



0.5 e 



-^logioOn/s.e)] 2 



if fl (m) > 1, (16) 

where fi(m) = log 10 [20 <?(m)] 1//2 if g(m) < 5, and = 1 
when g(m) is larger. Because n = 1 for a Poisson distribu- 
tion, this approximately accounts for the fact that the scat- 
ter in galaxy counts is sub-Poisson in low mass haloes. If we 



© 0000 RAS, MNRAS 000, 000-000 



On the streaming motions of haloes and galaxies 7 




r [Mpc//i] r [Mpc//i] 

Figure 5. The mean streaming velocity (left) and pairwise velocity dispersion (right) in the ACDM GIF semianalytic galaxy formation 
model. Filled circles are for the dark matter, crosses are for red galaxies, squares for galaxies which have low star formation rates, triangles 
for galaxies with high star formation rates, and open circles for blue galaxies. The solid and dotted curves show our predictions for the 
red and blue galaxies, and the dark matter, respectively. 



use the same galaxy sample that Scoccimarro et al. (2000) 
did, then our model for the scatter is similar to theirs. 

The correlation functions for these four subsamples are 
shown in Fig. 4: crosses, open circles, triangles and squares 
show £ g ai(r) for red, blue, star-forming and quiescent galax- 
ies in the GIF simulation. Filled circles show the correlation 
function of the dark matter particles. Notice how similar the 
correlation function of the blue sample is to that of the star- 
forming sample, how similar the red and quiescent samples 
are, and how different the blue and star-forming samples are 
from the red and quiescent samples. 

The two solid curves show the result of using our model 
to compute the correlation functions of the red and blue 
samples, because these differ the most from each other. We 
did this by setting g(m) in equation ( |l3| ) equal to the appro- 
priate JV ga i(m) relation. For comparison, the dashed curves 
show the result of using the second factorial moment, rather 
than the second moment, when computing the galaxy corre- 
lation functions; the difference only matters on small scales. 
The dotted curve shows our calculation of ^(r) which has 
g(m) = m. The bottom panel shows how b = y£gai/£dm 
(the ratio of the solid and dotted curves) depends on scale. 
Both panels show that our model provides a good descrip- 
tion of the simulation results. In computing our model pre- 
dictions, we assumed that the two samples both trace their 
parent dark matter haloes similarly. That is, we used the 
same function X(r\m) for both the red and the blue sam- 
ples. If the red galaxies were more centrally concentrated 
than the blue, we could have incorporated it into our anal- 



ysis by adjusting X(r\m). In fact, Fig. 2 of Diaferio et al. 
(1999) shows that, in the semianalytic model, the red galax- 
ies in massive clusters are concentrated more towards the 
centres of their parent halos than the blue ones are. The 
agreement between the simulation and our model curves in 
which we made no such adjustment for this suggests that it 
must amount to only a weak effect. 

We turn, therefore, to the first and second moments 
of the pairwise velocity distribution for these four subsam- 
ples. Fig. ^ shows results for the same semianalytic galaxy 
samples shown in Fig. 4. As before, crosses, squares, trian- 
gles and open circles show galaxies classified as being red, 
quiescent, star-forming, or blue. Filled circles show the cor- 
responding statistics of the dark matter particles. As for 
the correlation functions, the blue and star-forming samples 
are quite different from the red and quiescent samples. Our 
model shows that this arises simply from the fact that these 
samples have rather different N s &i(m) relations — there are 
only a few blue, star forming galaxies in clusters. 

Notice that blue galaxies (circles) have the smallest 
streaming motions, and red galaxies (crosses) have the 
largest V12 values. Our model predicts that larger stream- 
ing motions at large separations indicate a higher amplitude 
of clustering on those scales. Comparison with Fig. 4 shows 
that the correlation functions of the red galaxies are biased 
high relative to the dark matter, whereas the blue galaxies 
are biased low. This is in qualitative agreement with our 
model. 

The solid lines in Fig. [| provide a more quantitative 
comparison between our model predictions for the red and 
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blue galaxies, and the values of the galaxy velocities mea- 
sured in the GIF simulation. The model predictions are 
in reasonable agreement with the simulation, although the 
agreement is certainly not as good as it was for the correla- 
tion functions. Sheth et al. (2000) discuss the reason for the 
overestimate in v\2(r) on large scales (e.g., these models do 
not satisfy the integral constraint). The bottom panels show 
the ratio of the galaxy velocities to those of the dark mat- 
ter; i.e., b vi2 = vf^/vf™, and similarly for &cr 12 . This ratio is 
scale dependent on smaller scales. Our model describes the 
scale dependence reasonably well. 



4 DISCUSSION 

We presented a simple model of how the streaming motions 
of haloes and galaxies depends on separation. We tested the 
model using the publically available simulations of Kauff- 
mann et al. (1999). In the semi-analytic galaxy formation 
model, the mean streaming motions depend rather strongly 
on how the galaxy sample was selected. For example, blue 
galaxies have smaller streaming motions than red galaxies. 
We showed that our model was able to describe the differ- 
ences between a wide range of simulated galaxy catalogues 
rather well (Fig. ^). 

Our model predicts a very close relationship between 
the streaming motions of the galaxies and their spatial dis- 
tribution. Optical and IRAS galaxies cluster differently (e.g. 
Marzke et al. 1995; Fisher et al. 1994). Therefore, they 
must be biased differently relative to the dark matter. If 
the streaming motions of optical galaxies are the same as 
the dark matter, then our model predicts that the stream- 
ing motions of IRAS galaxies must be different from that of 
the dark matter. In other words, whether or not the stream- 
ing motions of a given galaxy sample trace the motions of 
the underlying dark matter depends on how the sample was 
selected. Thus, in the absence of strong arguments for why a 
given galaxy sample is expected to be unbiased, one should 
be cautious when interpretting measurements of the stream- 
ing motions of galaxies. 

If the correlation function and the streaming motions 
of two different galaxy samples have been measured, then 
the model described here (equations [ll] and |lj) says that 
the square root of the ratio of the two correlation functions 
(at, say, 20 Mpc/ft) should equal the ratio of the stream- 
ing motions on the same scale. This can be used to test the 
validity of the model. Again, however, caution is required 
because this relationship is only true on large scales. For the 
semi-analytic galaxy samples we presented, this simple lin- 
ear biasing was a good approximation on scales larger than 
about 10 Mpc/fo (although our model is able to describe 
the scale dependence of this ratio even on smaller scales). In 
this context, we think it worth noting that the semi-analytic 
galaxy samples we presented here cannot explain the results 
of Juszkiewicz et al. (2000) who found that ellipticals and 
spirals have the same value of V12 on separations of about 
10 Mpc//i, even though they estimate that the spatial dis- 
tributions have bias factors which differ by a factor of two: 
6b/6s ~ 2. It will be interesting if future data sets confirm 
this. 

In addition to studying how the mean streaming veloc- 
ity depends on galaxy type, our Fig. [5] also shows that the 



second moment of the pairwise velocity distribution depends 
strongly on galaxy type. For example, our model of the pair- 
wise dispersion suggests that the dispersion of blue galaxies 
should be substantially smaller than that of red ones (al- 
though this difference depends on the colour cut), especially 
on scales of 1 Mpc/h or so. Thus, it is not surprising that avi 
for optical galaxies (Marzke et al. 1995) is almost a factor of 
two larger than for IRAS galaxies (Fisher et al. 1994). The 
effects of redshift space distortions are larger if the pairwise 
dispersion is larger. This means that, on small scales, the 
amplitude of the redshift space correlation function of red 
galaxies should be substantially smaller than the real space 
correlation function, but the difference between real and red- 
shift space correlation functions of blue galaxies should not 
be as dramatic. This is a generic prediction of these sorts of 
galaxy formation models. This suggests that galaxy redshift 
samples cut by colour should provide a useful and direct test 
of these models. 

Dividing a galaxy sample by colour allows another sim- 
ple test of these models. At large separations, where most 
pairs are in separate haloes, the model described above pre- 
dicts that the cross-correlation function of the two colour 
samples should simply be the geometric mean of the two 
individual samples. If the two galaxy samples both trace 
their parent dark matter haloes in the same way (our Fig. 4 
shows that this assumption describes the semianalytic model 
well), then this will be approximately true even on smaller 
scales. (It will not be exactly true because the scatter in 
the iVg a i(m) relations are, typically, sub-Poisson.) Data sets 
currently available should be able to test this prediction. 
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APPENDIX A: THE EVOLUTION OF THE 
CORRELATION FUNCTION 

This Appendix shows how the correlation function and the 
mean streaming motions evolve in our model. 

The correlation function on comoving scale x when the 
expansion factor is a is the sum of two terms: 



/- /„ „\ ^lhalo/ \ i ,^2halo / \ 

4dm (x, a) = £ dm [x, a) + f dm [x, a) 



where 



^2halo/ \ ^Lin / \ 



dm/(m) b(m, a) 



(compare equation |l3|) , and 

m 2 n(m) \(x\m) 



^lhalo/ \ 
£dm (X) 



dm ■ 



(Al) 



(A2) 



(A3) 



P P 

For haloes of virial mass m which have density profiles of 
the form given by Navarro , Frenk & White (1997), 



\{x\m) — 



47T7.3 



(— )> 



(A4) 



where g(y) is given in Appendix A3 of Sheth et al. (2000). 
Here r v i r is the virial radius of the halo in proper physical 
coordinates, x v ^ — r v - 1Y /a is the comoving virial radius, and 
c(m) is a parameter which describes how centrally concen- 
trated the halo profile is: it is the ratio of the virial radius 
to a central core radius, and it depends on the halo mass. 
The normalization term is 



ln(l + c) - c/(l + c) 



(A5) 



The mean streaming motions from pairs in the same halo is 
given by equation ( |1C| ) in the main text. 

We would like to study what happens if we make the 
profile of the region containing m depend on time in the 
following way: we would like to use the same family of pro- 
files, such as the NFW set, to describe the density run at 



any time, and we want to parametrize the evolution of the 
profile shape by changing the values of the profile's parame- 
ters. There is no physical reason why the NFW form should 
describe the density run around a region which has yet to 
virialize; we are only using this to illustrate how our argu- 
ment works. 

For NFW profiles, this means that we will think of 
X(a) = X a as the comoving boundary of the region con- 
taining m at the time when the expansion factor is a: ini- 
tially X(a,i) — Xi, and today X(ao) = x v i T . In addition, we 
will allow the concentration parameter to depend on mass 
as well as time: c(m,a) = c a . The idea is that the density 
run around the centre of the region of radius X a containing 
m was presumably differently concentrated at at than it is 
today. In particular, we would like to see if this model can 
produce the same streaming motions as equation ( |lo| ) in the 
main text. 

To show what is required for this to happen, set 
mn(m)/p = ,f(m), and also set m = AnX 3 po/i, where Xi is 
the initial comoving radius of the halo, and po = p/a 3 is the 
comoving density of the background. Then the expression 
for the correlation function becomes 



^lhalo/ \ 
5dm [X) 



dm/(m) C- 



CaXi\ 3 4> 2 {c a ) 

X a 



( c a x\ 



(A6) 



The volume integral of this is 



«™>*M#)Ml?)' (A7 > 



dm 



where we have defined g(y) = dz z 2 g(z). So 



d In a 



dm f(m) 



ain^ 2 (c a ) (Xi 



din 



+ 



dln{c a /X a ) 3 <j> 2 (c a ) f c a X % 
d In a 3 V X a 



c a x 

XT 



,(A8) 



This will result in the same streaming motions as equa- 
tion (0) if 

91n0 2 (c a ) 



dlna 
dln(c a /X a ) 3 
dlna 



6/n 
3 + n« 

6/n 



and 



3 + n* 



(A9) 



where n* is the slope of the linear power spectrum on the 
scale on which the rms density fluctuation is 1.686, and 
fci = din D(a) /dlna ~ Q 0,6 , where D(a) is the linear the- 
ory growth factor. This sums up what is required of the 
time dependence of the concentration c a and the comoving 
radius X a containing m. Note that these requirements are 
non-trivial, because both c and X depend on the mass m, 
but we are requiring that the derivatives work out to be 
independent of m. 

We can see what this implies for the evolution of the 
correlation f unct ion if we insert the scalings of equation (A9) 
in equation (A6): 



^lhalo/ \ 

5dm (x,a)-- 



dm 



co Xi ^ 3 <£ 2 (co) ( c x/X 



\ £)2/(3+n) 
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Figure Al. The evolution of the mass profile if we require that 
the density profile have the NFW form at all times. For most 
of the comoving volume, the mass in a given comoving shell in- 
creases with time. Only within the core of the object does the 
mass decrease with time. We have truncated the profiles at the 
point where they all enclose the same mass m. 



where the subscript '0' denotes the values of quantities at 
the present time. This shows that the number of pairs on co- 
moving scale x at the present time is the same as the number 
of pairs which, at the earlier time when the linear growth 
factor was D — D(a)/Do were on the larger comoving scale 

x/D 2/{3+ n )_ 

The correlation function decreases monotonically with 
scale, so the expression above implies that ^Jjf 10 was smaller 
at early times than it is today. At very early times, therefore, 
the correlation function mi ght plausibly be dominated by 
the two-halo term (equation A2). The evolution of this term 
can be got from inserting the evolution of the bias factor 
(equation ^) into it. The integrals over m can be done an- 
alytically, with the result that €dS lo (x,a) = D 2 ^(x,a ). 
At sufficiently early times, this two-halo term dominates on 
all scales, so our model for the correlation function reduces 
correctly to the linear theory expression. 

Our requirement that the same NFW form hold at all 
times means that the profile shape evolves as 



Pa(s) 



(ft,)/3 



S {s+(Xa/Ca)/Xi) 2 



Pao(S) 

Pa 



D 3/(3+n t ) 



S + 1/co 



5+£)2/(3+n)/ Co 



(A10) 



where s is the comoving distance from the centre in units 
of the initial comoving scale Xi, whereas S is in units of 
the virial radius at do- The second equality follows from the 
scalings above for the evolution of (j>{ca) and X a /c a , and set- 
ting D = D(a)/Do- The profile evolves in such a way that 
the density on scales s > (X a /c a )/Xi grows as a increases, 
as we expect. On much smaller scales, s <C (X a /c a )/Xi, 
and the profile shape is more like (c a Xi/ X a ) 2 4>{pa) / S tx 
£)-' L /( 3 + n ) /g : on ver y small scales, the density decreases 
with time! 

Because this small scale behaviour of the density seems 
contrary to our intuition, we thought it worth studying the 
evolution of the mass as a function of comoving scale. The 
scalings above imply that the fraction of the total mass m = 



M(s)/M which is in the range ds around s from the halo 
centre when the growth factor is D = D(a)/Do is 



dm(s 
ds 



■ + r>2/(3+n.)/[ CD (A n i/f2)V3] 



(All) 



Fig. Al shows an example of how the mass gets redistributed 
as the profile evolves. It was constructed b y set ting n, = 
— 1.5, Co — 9 and D^/Q, = 180 in equation (All) (the first 
two values approximate those of an m„ halo in a ACDM 
simulation). The solid, dashed, and dot-dashed curves show 



equation (All) at D — 1, 0.75 and 0.5 respectively. For most 
of the volume of the halo, the mass in a given comoving 
shell increases as D increases. Only well within the core of 
the object does it decrease with time. We only show the 
shape of the profile out to the radius X a which contains 
the mass m. At D = 1, X a is the virial radius, which is at 
s = 1/5.6 ~ 0.18; X a was larger earlier, so that the total 
mass contained in the profile remains constant. By D — 0.6 
or so X a > 1, indicating that the model profile must extend 
beyond Xi if it is to enclose mass m; at this point the model 
has really broken down. 

A similar analysis of the Hernquist profile shows the 
same qualitative features: 



Pa{S) _ Pag(S) jj5 / (3+n 



P 



P0 



S + b 



S + b D 2 /( 3 + 



».)) 



(A12) 



Presumably, this apparently unphysical behaviour is a con- 
sequence of our unphysical requirement that the profile have 
the same functional form at all times. 
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